A lexicon-driven approach for optimal segment combination in off-line recognition of unconstrained handwritten Korean words

نویسندگان

  • Soo-Hyung Kim
  • S. Jeong
  • Ching Y. Suen
چکیده

We propose a new method for o!-line recognition of unconstrained handwritten words consisting of Korean and numeric characters. To overcome the di$culty in separating touching characters, we adopt an over-segmentation strategy. Given a slice of the input word image, we "nd the optimal segment combination using a lexicon-driven word scoring technique and a nearest-neighbor classi"er. The optimal combination gives the "nal segmentation positions for individual characters, along with the best matching word in the lexicon. Superiority of the proposed system has been proven by testing it with 908 images of unconstrained words handwritten on live mail pieces. 2001 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Lexicon Driven Approach for Off-line Recognition of Unconstrained Handwritten Korean Words

We propose a new method for the recognition of unconstrained handwritten words consisting of Korean and numeric characters. To overcome the difficulty in separating touching characters, we adopt an oversegmentation technique and we find the optimal segment combination using a lexicon-driven word scoring technique and a nearest neighbor classifier. The optimal combination gives the final segment...

متن کامل

An HMM-Based Approach for Off-Line Unconstrained Handwritten Word Modeling and Recognition

ÐThis paper describes a hidden Markov model-based approach designed to recognize off-line unconstrained handwritten words for large vocabularies. After preprocessing, a word image is segmented into letters or pseudoletters and represented by two feature sequences of equal length, each consisting of an alternating sequence of shape-symbols and segmentationsymbols, which are both explicitly model...

متن کامل

یک روش دو مرحلهای برای بازشناسی کلمات دستنوشته فارسی به کمک بلوکبندی تطبیقی گرادیان تصویر

This paper presented a two step method for offline handwritten Farsi word recognition. In first step, in order to improve the recognition accuracy and speed, an algorithm proposed for initial eliminating lexicon entries unlikely to match the input image. For lexicon reduction, the words of lexicon are clustered using ISOCLUS and Hierarchal clustering algorithm. Clustering is based on the featur...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

A Lexicon Driven Method for Unconstrained Bangla Handwritten Word Recognition

In this paper a lexicon driven segmentationrecognition scheme for unconstrained Bangla handwritten word recognition is proposed for Indian postal automation. In the proposed method, at first, binarization of the input document is done and slant correction of the individual words is performed. Next, using water reservoir concept words are pre-segmented into possible primitive components (charact...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Pattern Recognition

دوره 34  شماره 

صفحات  -

تاریخ انتشار 2001